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We propose a novel notion of a quantum learning machine for automatically controlling quantum 
coherence and for developing quantum algorithms. A quantum learning machine can be trained to 
learn a certain task with no a priori knowledge on its algorithm. As an example, it is demonstrated 
that the quantum learning machine learns Deutsch's task and finds itself a quantum algorithm, that 
is different from but equivalent to the original one. 
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Quantum automatic control- Quantum information 
science (QIS) aims to exploit quantum mechanics to im- 
prove the acquisition, transmission and processing of in- 
formation. This field has seen explosive growth in re- 
cent years, stimulated by the applications such as quan- 
tum cryptography and quantum communication which 
have the potential to surpass their classical counter- 
parts. In particular, quantum computation which was 
originally proposed by Feynman [l[ , received its momen- 
tum after considerable speedup was found for some al- 
gorithms including the Deutsch-Josza @, 0|, Shor's fac- 
torization 0, Grover's database search p, and hid- 
den subgroup problem algorithms. One of the important 
challenges in quantum computation is to find algorithms 
which can fully explore quantum parallelism for speedup. 

For the success of QIS, developing new quantum algo- 
rithms and enhancing the controllability of quantum co- 
herence and the ability for quantum-state engineering Q 
are important agendas and deserve novel approaches. 
Here, we propose a new method of quantum control, 
where quantum-state engineering is "automatically" im- 
plemented by using a feed-back method to adjust unitary 
operations so to eventually bring about a target state 
intended. The feed-back system performs a single-shot 
quantum measurement on the output system and figures 
out if it is in the target state. If not, it modifies the 
control parameters of unitary operation. The feed-back 
system repeats over an ensemble of given quantum sys- 
tems, one by one, until the adjusted unitary operation 
outputs the target state. 

In fact, various feed-back systems have been studied for 
classical and quantum automatic controls which include 
the quantum neural network Q, quantum-state estima- 
tion [9( and automatic engineering of wave packets for 
molecules or monochrome light fields with a genetic algo- 
rithm 

0,1 

Til . [l2| . Our approach of quantum automatic 
control contrasts with the aforementioned methods as we 
adopt the fundamentals of QIS. In this scheme, infor- 
mation processing tasks are performed by automatically 
adjusting the parameters of internal unitary operation. 
Hence we call the entire system including the unitary 
operation device of the quantum system, the single-shot 



measurement, and the feedback system as the "quantum 
learning machine (QLeM)." 

In this Letter, we investigate the possibility of the 
QLeM to develop a quantum algorithm. As an example 
for a deterministic quantum algorithm, we demonstrate 
that the QLeM learns Deutsch's task and always finds 
itself a quantum algorithm, which is different from but 
equivalent to Deutsch's algorithm. 

Quantum learning machine.- Given a task by a supervi- 
sor, the QLeM learns by itself how to perform the task. A 
task is represented by a function f(x) where x is an input 
and t x = f(x) is the target. Here, we should clarify that x 
and t are classical numbers and our QLeM takes classical 
numbers as input values and outputs classical determin- 
istic values. However, the internal operations before the 
measurement are all unitary, which will make sure the 
advantage of quantum parallelism. For the function /, 
the supervisor selects a set of K input-target pairs, T = 
{(xi, f(xx)), (x 2 , f(x 2 )), ■ ■■ , (xk, f{x K )}, and sends the 
set T to the QLeM through a classical channel. The ma- 
chine is supposed to learn and to perform the task, now 
represented by the set T. 

In order to perform quantum information processing, 
the QLeM contains a preparation device P to prepare 
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FIG. 1: Architecture of a quantum learning machine (QLeM), 
composed of a preparation device P, a unitary operation U, a 
single-shot quantum measurement M, and a feed-back system 
F equipped with a classical memory storage S. 
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the quantum system Q to be in a certain initial state, 
an operation device U performing a unitary operation 
on Q, and a measurement device M. The QLeM is to 
provide the basic building blocks, P-U-M, of informa- 
tion processing with a feed-back system F, so that F 
can adjust the control parameters of U, depending on 
the measurement outcome in M. For this purpose the 
feed-back system F is equipped with a classical memory 
storage S, which stores the parameter values of U and 
records the measurement outcomes at M, and two classi- 
cal channels Cfu an d Cmf, where Cfu (Cmf) enables 
one-way communication from F to U (from M to F). 
Fig. Q] presents the schematic diagram of the QLeM. 

The feed-back system F is responsible for QLeM's 
learning and eventually performing the correct task: It 
controls the operation device U to eventually bring out 
the target value for a given input in the set of task T. The 
feed-back system F determines if the target t has been 
obtained, as monitoring the outcome m from the mea- 
surement device M. The outcome is transferred from M 
to F through the classical channel Cmf- 

An input value x can be encoded either in the prepara- 
tion P or operation U device. In most cases, encoding in 
U is appropriate and this is the case for finding Deutsch's 
algorithm as shown later. In order to incorporate the en- 
coding process, U is decomposed into three sub devices 
Ui, U2 and U3, of which the middle one U2 performs an 
operation according to the input value x. 

The QLeM runs in an iterative way as checking if it 
always works for the given task T. At the first itera- 
tion, the feed-back system F selects randomly an element 
(x,t x ) G T. It prepares the control parameters in the 
operation device U. The action of the sub-device U2 is 
determined by the input value x. Thus F dials the pre- 
determined value of the parameter P2(x) for 1/2- Then it 
chooses arbitrary values of parameters pi and P3 for U\ 
and C/3, respectively. A typical set of such parameters is 
represented by a unitary operator, 



U(p) 



(1) 



where G is a vector of SU(d) group generators with d as 
the dimension of Hilbert space and p is called a coher- 
ent or generalized Bloch vector [l3| • The arbitrariness in 
choosing parameter values is crucial in our approach of 
the QLeM, as this implies the machine does not require 
any a priori knowledge about the algorithm (Note: pre- 
determining the parameters in U2 is a part of defining the 
task, not a part of the algorithm). Let us assume that 
the initial state prepared by P for the quantum system 
Q is |0) . After going through U, Q becomes to be in the 
output state, 

IVx) - U x \0) = L>3(P3)^2(P2(x))^l(Pl)|0). (2) 

The device M measures Q in the standard basis {|m)} 
and its outcome m is sent to F through Cmf- If m is 



equal to the target t x , F records "success", say bit '1', in 
the classical memory storage S and, otherwise, it records 
"failure", bit '0', in S. 

At every iteration, the QLeM repeats the preparation 
process to re-initialize the quantum state. Then, the 
feedback system F adjusts the control parameters pi t 3 
in the operation device U . The classical memory stor- 
age S keeps the record of success/failure. If it is fully 
occupied, S eliminates the oldest record and shifts each 
record to the next cell, as seen in Fig. [2] If the storage 
is all filled by success, F terminates the learning process, 
which is called the halt condition, and announces the val- 
ues of pi,3 to the supervisor. The memory size, denoted 
by N, decides the precision of the QLeM. We now have a 
learning probability P(n) which denotes the probability 
that the QLeM completes the learning process before or 
at the nth iteration. We can also define a survival proba- 
bility Q(n) = 1 — P(n) as the probability that the QLeM 
does not complete until n (the term survival probability 
is from the theory of random walks with a trap) . 

A learning algorithm tells the feedback system F how 
to update the parameter values pi^ in the operation 
device U and when to halt the learning process before 
announcing pi 3 to the supervisor. Our design of the 
learning algorithm is as follows. At the reception of the 
parameter vector p( n ) = (pj , p|™ ) for the nth itera- 
tion, (and also P2(x) for the random input value x), U 
performs the corresponding unitary operation and results 
in an output state \if> x ), as in Eq. ((2|). Measuring \ip x ), 
M judges if the measurement outcome is the same as 
the target t x and it sends the result to F. F records 
the outcome in S as described earlier. If the operation 
was successful, F trusts the parameter values and leaves 
them unchanged: p( n+1 ) = p( n ). Otherwise, F needs 
to modify the parameter values. Instead of using any a 
priori knowledge on the algorithm, F generates another 
random vector r and adjusts the parameter vector as 



p (n + X) = p (n) + ^ 



(3) 



where Nq and Ni are respectively the numbers of failure 
and success events so far, and Nt = min(iV, Ni + No). 
Our learning algorithm is intuitively understandable: the 
more the number of failure events, the more the adjust- 
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FIG. 2: The scheme for updating the records in the classical 
memory storage S when S is fully occupied. Here, '1' and 
'0' denote success and failure, respectively. S records sequen- 
tially each measurement outcome. If it is fully occupied, the 
oldest data is deleted. Then the remaining data are shifted by 
one cell into the next position and the newly emptied memory 
cell is filled up with the new data. 
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ment is imposed to the parameter, p remains invariant 
if all events were successful. Note that the oldest records 
in S will be eliminated as the learning process continues, 
keeping F on using the latest records for the adjustment. 

It is worth noting that the QLeM completes the learn- 
ing process much more efficiently than the case of the 
choice of parameters randomly without an access to the 
memory of success or failure [14j. The QLeM with this 
learning algorithm can be modeled by a random walk 
where its survival probability becomes an exponential 
function in the form of e( n_1 "™ c , w ith a characteristic 
constant n c . Once the QLeM completes the learning 
process for the given task, it transmits the parameter 
values to the supervisor. Then, the supervisor analyzes 
them, decomposes the unitary operations into a sequence 
of universal gates @, EH , and compares the sequence to a 
classical one. If it works better than its classical counter- 
part, the sequence is a quantum algorithm for the task. 

Example: Finding Deutsch's algorithm.- To investigate 
the possibility of QLeM for developing new quantum al- 
gorithms, we consider a QLeM for Deutsch's problem 
of judging if a function is balanced or constant. The 
QLeM will be shown to find optimal algorithms, possibly 
different from but equivalent to Deutsch's original algo- 
rithm. For the purpose we employ a quantum Monte- 
Carlo method to numerically simulate an experiment. 

Deutsch's problem is to decide if an arbitrary binary 
function is constant or balanced [l6| . Consider a function 
x with the domain and the image both being the binary 
set {0, 1}. There are four possible functions, Xi\ 



z (0) = 0, a:o(l) = 0; 
x 2 (0) = 1, x 2 {\) = 0; 



zi(0) = 0, :ri(l) = 1; 
x 3 (0) = 1, x 3 (l) = 1. 



(4) 



If Xi(0) = Xi(l) as in xq j3 , the function Xi is said to be 
constant. Otherwise, the function is balanced (as in 21,2)- 
The classical algorithm is simple: Obtaining the values 
y = x(k) for k = 0, 1, it judges if x(0) — x(l). Such an 
algorithm requires two queries of x for the both values 
of k. On the other hand, Deutsch's algorithm enables 
the judgement of x only by a single query, as it uses a 
quantum superposition of k = and 1 . 

The quantum circuit for Deutsch's algorithm is pre- 
sented in Fig. [3J In the circuit, H is the Hadamard 



gate which transforms |0) or |1) to a quantum super- 
position H\0) = (|0) + |1))/V2 or H\l) = (|0) - |1))/V2, 
respectively. The operation device Uf is a gate to calcu- 
late a given function Xi, which transforms \ki,k 2 )AB to 
|fci, k 2 © Xi{ki))AB- After going through the gates in the 
circuit, the qubits A and B are in the state 



\i>s) = 



±10). 

±li). 



(|0)-|l))i 
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(|0)-|l))i 
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if Xi is constant, 
if Xi is balanced. 



(5) 



The outcome at the measurement device M tells us if 
Xi is constant or balanced. It needs to detect a single 
qubit in the standard basis {|0), |1)}. The efficiency of 
the quantum algorithm is dramatically improved by en- 
larging the domain of a function, as in Deutsch-Jozsa 
algorithm [1, Q . 

We consider the QLeM that learns Deutsch's task and 
finds by itself an optimal internal operation. Deutsch's 
task is represented by a set, 



T : 



{(x ,f(x ) 

(X2,f{x 2 ) 



c), {xx,f(xx) 
b), (x 3 , f{x 3 ) 



b), 
= c)}, 



(6) 



where the input Xi is defined in Eq. ([4]) and 'c' and 'b' 
stand for constant and balanced, respectively. 

The QLeM is schematically presented in Fig. |U The 
preparation device prepares two qubits A and B to be in 
certain fixed states, say |0) and |1), respectively. In or- 
der to maximize the quantum parallelism, the number of 
input qubits has been chosen to be 2, for k can take two 
values and 1 in Xi(k). The choice of |0) and |1) for A 
and B can be random. They may be chosen differently as 
far as they are fixed throughout the learning process. The 
middle sub-device U 2 is placed to calculate the function 
of a given input Xi{k). The first sub-device U\ performs 
a two-qubit unitary operations. On the other hand, U 3 
does a single-qubit unitary operation before the measure- 
ment. The single qubit measurement has been chosen as 
there is only one bit of information, c and b, for the tar- 
get value. U has 18 control parameters (HI; 4 2 — 1 = 15 
for two-qubit operation U\ and 2 2 — 1 = 3 for one-qubit 
operation U 3 . The sub-device U 2 is a part of defining the 
task and the parameter values are predetermined with 
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FIG. 3: Quantum circuit for Deutsch's algorithm, where H is 
Hadamard operator and Uf is an operator for a given function 
Xi which results in the output state \ki, ffi Xj(fci)) when 
applied on the initial state jfajfe). 



FIG. 4: Architecture of the QLeM for Deutsch's task, where 
the unitary operation U consists of three sub-operations (see 
the text). Here Ui and C/3 are two-qubit and single-qubit 
operators, respectively. 
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FIG. 5: (a) Learning probability P(n) and (b) survival prob- 
ability Q(n) = 1 — P(n) in terms of the number of iterations 
n, constructed by quantum Monte-Carlo method with 1000 
trials for 300 bits of the classical memory storage. Solid line 
is a fitting function. The survival probability is well fitted to 
Q(n) = e~^ n ~ 1 ^ n " with a fitting parameter n c . 
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FIG. 6: a) Fidelity (T) averaged over 1000 trials for a given 
memory size N . Each error bar is a standard deviation over 
the trials, (b) Characteristic constant n c as a function of N 
on a log-log scale. The data are well fitted to n c = AN D with 
D ~ 3.46 and A ~ 10" 106 . 



respect to the input function Xi , such that U% transforms 
an input state \k\, k2)AB to |fci, k% © x%(k\))AB- 

We performs the quantum Monte-Carlo simulation, 
sampling 1000 trials, where the QLeM resets all the pa- 
rameters for each trial. In Fig.[5fa), we present a learning 
probability P(n), constructed by 1000 trials with 300 bits 
of the classical memory employed. The survival proba- 
bility Q(n) = 1 - P(n) in Fig. Mjo) is well fitted to an 
exponential function, e^™ -1 -*/" 1 with the characteristic 
constant n c , which characterizes how many iterations are 
necessary for the completion of learning. Note that n c is 
finite and smaller than that for the case of learning algo- 
rithm without a memory facility [l4j. The QLeM is able 
to learn how to perform Deutsch's task in a finite number 
of iterations, which is witnessed by the exponential decay 
of the survival probability with regard to the number of 
iterations. We have also noted that each unitary opera- 
tor Ui after completing the learning process is a product 
of two one-qubit operators within the precision allowed. 
The two one-qubit operators are in general not equal to 
the two Hadamard operators as in Deutsch's algorithm. 
Nevertheless, the internal operations, U% and U3 perform 
Deutsch's task so that the quantum algorithms identified 
by the QLeM are equivalent to the Deutsch's original one 
as far as all the one-qubit operations cost the same. 

We calculate the fidelity to find out how close the quan- 
tum state before the measurement is to the target state 
\t x ) when the QLeM completes the learning. In Fig.[5£a), 



we present the average fidelity with respect to a memory 
size N. Here the fidelity T — ^2 x —o(t x \Px\t x ) /4 is av- 
eraged over 1000 trials for a given N, where p x is the 
density operator for the QLeM output state. The aver- 
age fidelity T approaches unity as the memory size N 
increases, as we expected. For instance, T is as large as 
0.989 for N = 300. Fig. fflb) presents the characteris- 
tic constant n c as a function of N on a log-log scale. It 
is found that the characteristic constant n c is well fit- 
ted to a polynomial function of N, i.e., n c = AN D with 
D ~ 3.46 and A ~ 10~ 106 . 

In summary, we have proposed a novel notion of the 
QLeM for automatically controlling quantum coherence 
to find a quantum algorithm by itself. We have illus- 
trated, for Deutsch's task, how the QLeM finds a quan- 
tum algorithm and shown that the algorithms identified 
by the QLeM are equivalent to Deutsch's algorithm us- 
ing the fidelity which is ~ 1 for a finite number of itera- 
tions. This will open a new field of research to find a new 
quantum algorithm and further studies are necessary to 
improve the learning algorithm. 

We thank Prof. C. Brukner for discussions. We ac- 
knowledge financial support from Korean Research Foun- 
dation Grant funded by the Korean Government (KRF- 
2005-041-c00197), UK EPSRC and QIP IRC. 

Note added.- As completing this work, we recently 
found a related work, which considers probabilistic 
quantum learning for database search and factorization 
tasks (Tt I . 
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